Higher layer coding of non-speech like signals using factorial pulse codebook
نویسندگان
چکیده
A transform coding method for coding higher layers of a multi-layer embedded speech and audio coding system using factorial pulse codebook is proposed. The proposed methods use frequency selective attenuation of lower layer output to reduce the spurious noise generated when speech model based coding method is used in lower layers for coding of the nonspeech signals. The frequency selective attenuation along with the use of factorial pulse codebook makes the method suitable for coding non-speech like signals. A classifier for deciding whether a signal is speech like or non-speech like is also proposed. The proposed method is a part of an ITU embedded speech/audio coding standard (ITU-T G.EV-VBR). The formal listening tests confirm the benefits of using the proposed method for coding of music signals and speech signal having background music.
منابع مشابه
Using Various Types of Excitation Signals
A high-qulaity speech coding method (SPMEX) at 4.8 kb/s is proposed. The SPMEX selects a suitable excitation signal, based on the decision from aconstic features of speech signal in a frame. lmproved pitch interpolation multi-pulse (PMPC) excitation is selected for vowel-like speech. In PMPC, multi-pulse during only one pitch period is calculated in the frame. Fnrther, gain and phase adjusting ...
متن کاملA 4 kbps adaptive fixed code-excited linear prediction speech coder
In this paper, we propose an adaptive fixed code-excited linear prediction (AF-CELP) speech coder operating at 4 kbps. By exploiting the fact that a fixed codebook contribution to speech signal is also periodic as the corresponding adaptive codebook contribution, the adaptive fixed codebook model efficiently represents excitation signals. In order to overcome the quality degradation caused by t...
متن کاملAn Efficient Transcoding Scheme for G.729 and G.723.1 Speech Codecs: Interoperability over the Internet
This paper proposes an efficient conversion algorithm for G.729 and G.723.1 speech codecs to reduce computational complexity of the communications between the G.729 and G.723.1 speech codecs. The proposed transcoding method incorporates four processes: line spectral pair (LSP) interpolation, pitch conversion, fast adaptive-codebook search, and fast fixed-codebook search. To reduce search comput...
متن کاملHigh quality multi-pulse based CELP speech coding at 6.4 kb/s and its subjective evaluation
This paper proposes an MP-CELP (Multi-Pulse-based CELP) speech coding at 6.4 kb/s with 10 ms frame. In MP-CELP, amplitudes or signs of multi-pulse excitation are simultaneously vector quantized (VQ). A combination search between multiple pulse location candidates and VQ codebook remarkably improves the quantization performance. In order to improve speech quality for background noise conditions,...
متن کاملOn Improving the Performance of an ACELP Speech Coder
In this paper we evaluate the performance of a variety of techniques to improve the parameter analysis in CELP speech coders. These methods include using extended cost horizon in the fixed codebook search process, as well as joint optimization and delayed decision coding of the adaptive and fixed codebook parameters. Based on our simulations for the IS-641 speech coder, substantial improvements...
متن کامل